Combining Information Retrieval Methods to Improve Automated Traceability Recovery

نویسندگان

  • Malcom Gethers
  • Rocco Oliveto
  • Denys Poshyvanyk
  • Andrea De Lucia
چکیده

The intensive human effort needed to manually manage traceability information has increased the interest in utilizing automated traceability recovery techniques. Promising results have been achieved using Information Retrieval (IR) techniques to identify candidate traceability links between different types of artifacts—e.g., requirements, use cases, code classes. Different IR methods have been proposed to support such a task but until now there is no single method that sensibly outperforms the others. In this paper we propose to combine different IR techniques aiming at improving the accuracy of stand-alone IR-based traceability recovery methods. The results of a case study conducted on two software repositories demonstrate that the combination of one of the IR methods experimented with the others provides statistically significant improvements as compared with stand-alone methods. The nature of the two repositories used in the case study also allows us to investigate the influence on the recovery accuracy of the language and the type of artifacts to be traced.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recovery of Traceability Links between Software Documentation and Source Code

An approach for the semi-automated recovery of traceability links between software documentation and source code is presented. The methodology is based on the application of information retrieval techniques to extract and analyze the semantic information from the source code and associated documentation. A semi-automatic process is defined based on the proposed methodology. The paper advocates ...

متن کامل

Université De Montréal Analysing Source Code Structure and Mining Software Repositories to Create Requirements Traceability Links

Traceability is the only means to ensure that the source code of a system is consistent with its requirements and that all and only the specified requirements have been implemented. During software maintenance and evolution, as developers add, remove, or modify features (including bugs), requirement traceability links become obsolete because developers do not/cannot devote effort to update them...

متن کامل

Applying a smoothing filter to improve IR-based traceability recovery processes: An empirical investigation

Context: Traceability relations among software artifacts often tend to be missing, outdated, or lost. For this reason, various traceability recovery approaches—based on Information Retrieval (IR) techniques—have been proposed. The performances of such approaches is often influenced by “noise” contained in software artifacts (e.g., recurring words in document templates or other words that do not...

متن کامل

Do Information Retrieval Algorithms for Automated Traceability Perform Effectively on Issue Tracking System Data?

[Context and motivation] Traces between issues in issue tracking systems connect bug reports to software features, they connect competing implementation ideas for a software feature or they identify duplicate issues. However, the trace quality is usually very low. To improve the trace quality between requirements, features, and bugs, information retrieval algorithms for automated trace retrieva...

متن کامل

New Ideas and Emerging Results Track: a Combination Approach for Enhancing Automated Traceability

Tracking a variety of traceability links between artifacts assists software developers in comprehension, efficient development, and effective management of a system. Traceability systems to date based on various Information Retrieval (IR) techniques have been faced with a major open research challenge: how to extract these links with both high precision and high recall. In this paper we describ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010